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Abstract. As the initial location of the vector data, this paper presents a 
new active contour model, which is used to extract area water bodies from 
remote sensi ng i mages. The new active contour model is added to the i mage 
gravitation potential energy based on object- background gray values, as 
well as to the similarity restriction potential energy based on discrete 
curvature. The purpose of the new model is to improve the constringency 
speed and noise immunity of the existing model, and to avoid noise from 
occurring on the curve point of attraction and disturbance. The new model 
makes thorough use of prior information obtained from vector data, in 
order to adaptively compute the correlative parameters. In addition, the 
extraction precision evaluating model is established based on similarity 
measurement, and the model solving process is determined based on the 
greedy algorithm. Finally, experiments are performed to show the 
feasi bi I i ty and superi ority of the method. 

Keywords: spatial data updating, contour extraction, active contour 
model, similarity restriction, prior information 



1. Introduction 

Both the focus and most difficult point of this study is the automatic 
extraction of geographic information from remote sensing images (those 



involved in this study being visible images), so as to continuously update 
vector data. One of the main directions of study in this field is to extract 
geographic information using an active contour model for example: 
SALMAN (2010), Matthias (2004, 2007, and 2010), HU (2009), Miao 
(2010). The concept of the active contour model was first presented by Kass 
in 1987 Kass (1987); its main advantage is that it integrates image data, 
initial estimation, target contour characteristics and knowledge- based 
restriction conditions into a single characteristic extraction process, which 
is performed during model evolution. However, the largest setbacks of the 
traditional active contour model are that it is very sensitive to initial 
location CHEN (2007), and that it is not able to easily obtain the initial 
location; at the same time, the vector data may be regarded as the initial 
location of the active contour model, and may provide a large amount of 
priori knowledge during the data updating process. Therefore, one common 
method used by researchers is to apply vector data as previously existing 
knowledge in image contour edge extraction HaN (2009) and the integrated 
update model based on images and vector data has been preliminarily 
applied in large-scale engineering practices CHEN (2010). For instance, 
HaN (2009) presented an automatic GIS vector boundary update method 
based on the grouping snake model; Zhang(2007) and Stefan(2008) 
detected and extracted change information of water systems with the active 
contour model based on images and vector data; and Matthias(2007) 
integrated raster and vector data with the traditional snake model. 

The main disadvantages of the above methods are as follows: The noise 
immunity of the model is poor, and the contour control points may easily 
converge to local extreme points, due to the attraction and disturbance 
ori gi nati ng from noi se poi nts, causi ng the f i nal I y extracted area water body 
to vary greatly from the actual contour. The initial vector and image data 
possess different currencies, and differences also exist in their spatial 
locations, shapes, areas and other characteristics; however, due to the fact 
that they express the same ground obj ect entity (such as a water body) , they 
are greatly similar to each other in terms of shape. Consequently, in order 
to retain the similarity between the geometrical shape of the obj ect and the 
contour shape of the vector data, this paper adds similarity potential energy 
to the model so as to extract correct water body i nformation. 

In addition, traditional models only use gradient intensity to construct 
external forces. If the gradient force is a short-range force and the vector 
data are located at a significant distance from the object's edges, this short- 
range force may not be able to "pull" the vector contour to the true edges, 
thus causing the convergence speed to become slow. Therefore, this paper 
designs a long-range image gravitation based on the object-background 
gray value, in order to improve the convergence speed; as the adaptive 



ability and automation degree of traditional models are both quite poor, 
References SALMAN (2010) and HU (2009) applied prior information 
which had been obtained manually. The paper makes thorough use of the 
prior vector data information, and attempts to compute parameters 
adaptively in the model realization process, so as to enhance the 
automation degree. 



2. Traditional active contour model 

The active contour model means movable curves defined in the image 
domain and their movements are controlled by their own internal force as 
well as the external force from the image data. The internal force restrains 
the shapes of the curves and retains curve smoothness and continuity, and 
the external force leads the contour curves to evolve toward the target 
characteristics. Finally, the total energy of the curves reaches the mini mum 
level under the combined balancing effects of the external and internal 
forces, and the contour information of the object may be extracted. The 
mathematical expression of the active contour model is expressed with 
parameters. SupposingvO) = [x(s), y(s)] , s e [0,1] where s represents the 
arc length of the normalized curve, the energy function of the active contour 
model may be expressed as shown below: 

E snake = f W) I v(s) I 2 +/?(*) I v\s) I 2 -r(s) I V(G CT (x, y)*I(x, y)) l 2 ]Js 
(1) 

Where the former two integral items represents internal energy; the first 
derivative represents elastic energy and restrains the continuity of the 
curve; the second derivative represents bending energy and restrains the 
smoothness of the curve; and the thi rd item represents external energy and 
leads the curve to approach the object. I n order to extract the contour, the 
external energy restrains the curve reaching minimum at the image edges, 
and the gradi ent i mages are used for the expressi on. I n the gradi ent i mages, 
V signifies gradient operator, G a *I signifies the convolution of Image / 
and the Gaussian smoothing filter with the standard deviation bei ng <j . The 
purpose of the convolution isto reduce the influence of noise on the results. 
a(s)> /?0)and y{s) represent the various control weighing coefficients. 
The total curve energy expressed in Formula (1) within the whole image 
domain reaches the minimum pursuant to the variation principle, and the 
curvein conformity with conditions is the final result. 

During the actual computing process Formula (1) must be discretized. 
Supposing that there are n points on the curve, i.e. 
v, ,=(.*,., y t )(i = 1,2,..., n) , the first integral item in Formula (1) may be 



discretized as aO')^, - Vi||_ - which is generally substituted with 
(d-lk-VjJ) 2 . Moreover, d signifies the mean distance between the 
poi nts on the curve and its purpose is to ensure that the poi nts on the curve 
distribute uniform y as distantly as possible. The second item may be 
discretized as /?(/) \v M -2v i +v ! ._J and the third item may be discretized 
as y(i)E img (i) , wnere E img (i) signifies the value computed based on 
convolution and gradient at Point * . Finally, Formula (1) may be discretized 
as shown below: 

E mal = I>(0(^-|v, -v ( - 1 ||) 2 +^(0||v I . +I -2v ( . +vj -rm img (i) 
(2) <-=i 



3. Improvement of active contour model 

3.1. Image gravitation potential energy based on object- 
background gray value 

3.1 .1 . Design of image gravitation potential energy 

In regard to traditional models, the gradient force is a short-range force. 
When the initial vector data is located at a significant distance from the 
actual contour, the model may converge to local minimum so that it 
becomes impossible to obtain the true contour and the speed is very slow. 
Therefore, this paper designs a long-range force based on adequate prior 
information of the vector data. As shown in Figure! if the image is divided 
into two areas, namely the object and the background, their respective 
average gray values are Avg a ,Avg h , their standard deviations are 
St d , Std b , where O represents the center of the contour shape of the vector 
data, a,b respectively represent the poi nts on the contour line of the vector 
data, and r a ,r b represent the straight-line distances from a,b and O. 




Figure L I mage gravitation potential energy 



Supposing that the image gravitation potential energy based on the object- 
background gray value is E out (i) at Point v, and E out {i) represents the 
distance energy related to r t , i.e. the distance between v ; and Point O , this 
potential energy must possess following function: when the points are 
located at a significant distance from the object's edges, rapidly pull the 
control points to approach to the true contour. Consequently, the E out (i) 
may be expressed with Formula (3): 

E out (i) = sgn(i)/r? (3) 

where sgn(z') is a sign function. If v i is located within the area and 
sgn(z') = 1 , then r t will increase during the energy minimization, E out (i) will 
decrease, and v i will be forced to move outward, in a manner similar to 
Point a in Figure X on the contrary, if v j is located outside of the area and 
sgn(z') = -l, then both r t and E OM (i) will decrease during the energy 
minimization, and v, will be forced to move inward, similar to Point b in 
Figure X when the points are close to the edges, E out (i) may be ineffective 
and sgn(z') = 0, similar to Point f in Figure 1 During the minimization 
process, the points which are located far from edges, i.e. Points^,/? , may be 
evolved toward edges at a certain step length along the r a ,r b directions. 
Such a tactic produces a convergence speed which is faster than the 
evolution in the 3x3 or 5x5 ranges of the point. If the points oscillate 
around the edges (as Pointsc, d in Figure 1 do), then the evolution of the 
point will immediately cease, and the average location between the two 
osci 1 1 ati on poi nts may be regarded as the f i nal I ocati on of the poi nt. 



3.1 .2. Automatic identification of object-background points based on range 
statistics 

The key point of computing E out (i) is to determine where Point v i is 
located, in the target area, either outside of the target area or at the edges. 
Therefore, the prior information of the area enclosed by the vector contour 
must be utilized adequately, i.e. the quantity proportion of the pixel points 
in the target area enclosed by the vector contour side line must be larger 
than those in the more distant background area; furthermore, the 
distribution of the pixel gray valuein the target area must be more uniform 
than that in the background area, i.e. Std < Std h ; and finally, in regard to 
the water body, the mean gray val ue of the target area must be smal I er than 
that of the background area farther, i.e. Avg < Avg h . Based on the 
aforementioned prior information, this paper lists the steps of 
automaticallydeterminingthelocation area of v,, as described below: 



® Compute the quantity of pixel points n in the area enclosed by the initial 
vector contour side line and count the pixel quantity n q corresponding to 
every gray level q in [0,1. ,255]; 

(D Sort n q in descending order, analyze the ranked n sequence 
n Q ,n x ,...,n t ,...,n 255 , and compute the n t and corresponding gray level t . The 
t may be computed in the following manner: in the n q sequence, 
n t -n M >10(n r _j -n t _ 2 ) , that is, the difference ratio of two adjacent 
sequences must be at least larger than a magnitude, and the t which 
conforms to such condition is the first t emerging in the ranked sequence. 
The purpose of this step is to identify and sum up the gray levels and pixel 
quantities of the target and the background areas enclosed by the vector 
data. In the ranked n q sequence, n Q ,i\,...,n t belong to the target area, and 
n t+1 ,n t+2 ,...,n 255 belong to the background area. 

(§) Compute the mean pixel value Avg and standard deviation Std of the 
target area. M oreover, 



it it t 

A vg„ = X <* n <, 1 Z n « • Std o =SL n i ( Av 8o - q) i E n 9 ■ 

q=0 q=0 \ q=0 q=Q 

® Identify the area where Point v, is located. Supposing the mean gray 
value of the 3x3 or 5x5 range of Point v ; is avg , if avg < Avg a +X l Std a , 
then Point v,. is located in the area; if avg > Avg +l 2 Std , then Point v i is 
located outside of the area; if Avg +\Std <avg < Avg +A 2 Std , then 
Point x i is located close to the area edges. According to the image quality, 
the value of \ is generally between 1 and 2, and the value of X, generally 
ranges between 4 and 6. 1 n this paper, the value of \ is 1 and that of X, is 
4. 

3.2. Geometrical shape restriction based on similarity potential 
energy 

When the traditional active contour model is used to extract the contour 
side line, the background noise will produce a strong attraction and 
disturbance to the contour side line, thus causing extraction failure. After 
comparing the vector data contour with the image contour, it was found 
that their respective contour shapes were roughly similar. In order to 
adequately utilize this prior information and reduce the influence of the 
noise, this paper introduces the similarity potential energy based on 
contour curvature. 

Curvature is a parameter which may be used to describe the bending of a 
part of the curve; it also possesses displacement invariance and rotation 



invariance, and has the characteristic of convex- con cave invariance. 
Therefore, curvature is used to measure the similarity of the curve before 
and after evolution. As shown in Figure 2, the discrete curvature of Point 
v ( . , a point on the discrete c urve, is k j = sgn(/)Aa ( . / As, , w here Aa t signifies 
the angle between vectors v i _ 1 v j and v,.v w , As,. = v i _ x v i +v j v M signifies the 
approxi mate arc length, and sgn(z') signifies the sign function. If Point v,. is 
convex, then sgn(z') is equal to X if it is concave, then sgn(0 is equal to -1 



4 




Figure 2. Discrete curvature 

The similarity potential energy E sim (i) of Point v, on the curve may be 
defined as follows: 

E sim (i) = (k (i)-k(i)) 2 (4) 

where k (i) represents the curvature of Point v i on the initial vector 
contour curve, and k(i) represents the curvature of Point v, on the current 
evolved contour curve. 

Integrating Formulas (2), (3) and (4), the improved active contour model 
may be obtained: 

E l = icc(i)(d-\\ Vi -v w |) 2 +yS(i)|v,. +1 -2v, +v,._ 1 | 2 -y(i)E img (i) + v(i)E <M (i) + v(i)E si Ji) 

where cr(z') and rj(i) represent the corresponding weight coefficients. It is 
possible to extract the extremals of the image edges after Formula (5) 
reaches the mi ni mum wi thi n the whol e i mage domai n. 



3.3. Solution and realization of the model 

3.3.1 . Design of image gravitation potential energy 

There are many methods which may be used to solve the minimization of 
Formula (5), but the greedy algorithm presented by Amini (1990) and 
Williams (1992) isadopted most frequently. When using this algorithm, the 
convergence speed is high and the complexity is low. Although it is difficult 
to ensure global optimization, overall it is quite an effective method. This 
paper also utilizes the greedy algorithm to solve the model. The basic 



principles of the greedy algorithm are as follows: in regard to any point v, 
on the curve, as shown in Figure 3, first determine the Point v min which can 
minimi zetheFormula (5) in the neighborhood (3x3 is the neighborhood, as 
shown in the figure), then move Point v i to Point v min , and process v p v 2 ,... 
successively. After all points on the curve have been processed, an iteration 
may be formed and a new opti mal curve may be generated. 




Figure 3. Search of optimal neighborhood of greedy algorithm 

I n order to improve the convergence speed and pull the vector data contour 
so that it approaches the edges rapidly, this paper proposes a two-stage 
processing tactic: I n the first stage, the image gravitation plays the leading 
role and the rough contour of the object may be obtained. Moreover, in this 
stage the shape keeping is omitted. Thus, the E out (i) only reaches the 
minimum during the search based on the evolution plan shown in Section 
3.11 In the second stage, the active contour model omits the E out (i) (i.e. 
omit the E out (i) in Formula (5)). This search tactic based on the greedy 
algorithm forces the rough contour to gradually approach the true one and 
ensures that the contour remai ns si mi I ar to the ori gi nal shape. 

During the second-stage search, the following tasks must be completed as 
well: ® Normalize all energy items listed in Formula (5). The normalization 
methods of the first and second items of internal and similarity potential 
energy E sim (i) must be the same, i.e. divide their maximums in the 
neighborhood then normalize the results to the intervals [0,1]. As for the 
third item E img (i) , in order to highlight the effect of the gradient force, 
search the maximum maxmag and minimum minmag of each point on the 
curve in the neighborhood (supposing the gradient value of this point is 
mag ), and normalize E img (i) using the formula 
(minmag -mag) /(maxmag -minmag) . If the gradient values in the 
neighborhood are similar, then the condition that minmag =maxmag-5 if 
maxmag - minmag <5 may be made in order to avoid large deviation 
occurring to the gradient force. (D Self-adaptive increase and decrease of 
the quantity of points on the curve. I n order to promote the edge refining 
ability of the model, a new point must be added in the center when the 



distance between two points is larger than the set threshold (which is 
generally set as 0.2 mm). If the distance between two points is smaller than 
the set threshold, or three points are arranged in a straight line, then the 
corresponding points may be deleted. © Self-adaptive determination of 
weight coefficients. I n most references, the weight coefficients a(i) /?(/) 
» y(i) and rj(i) are set manually based on experience, and kept constant 
throughout the entire iterative process. The main issue with this method is 
that the coefficients may not be determined self-adaptively as the location 
and neighborhood of each point. I n fact, when different points move around 
in their neighborhoods to search for the optimal value, the change rate of 
each energy item differs. If the same weight is applied to all points and 
different iterations, this will result in incorrect results. Consequently, this 
paper proposes a self-adaptive weight determination method: in regard to 
any point v, on the curve, when it searches the optimal value in an 
mxm neighborhood, the model may record the energy values 
corresponding to each neighborhood point, then sum up and compute the 
standard deviations of those energy values. The reciprocals of each 
standard deviation are the weight coefficients of the energy items. The 
weight coefficients determined by this method may change self-adaptively 
accordi ng to the I ocati ons of the poi nts on the curve. 



3.3.2. Steps of the algorithm 

Step L Register the vector data and remote sensing images precisely. 
Process the gray values of the images, unify and register the coordinate 
system and map projection of the remote sensing images and vector data, 
and build the mapping relationship between the image coordinates and 
geographi c coordi nates. 

Step 2: Read the data of the vector contour side line of the area water body 
and initialize the model. Compute Avg a and Std as shown in Section 3.1; 
compute k (i) as shown in Section 3.2; compute the contour center of the 
vector sidelines. 

Step 3: First- stage search. Obtain the rough contour of the object as the 
search tactic as shown in Section 3. IX then stop searchi ng. 

Step 4: Begin second-stage search. Implement self-adaptive increase or 
decrease to the point quantity as described above in the section detailing 
the optimization tactic, so as to enhance the refining ability. 

Step 5: Compute all points on the curve and move them to the locations 
with the minimum energies in the neighborhood, including the following 
specific steps: 



® Count from i =lto n (where n represents the quantity of the points on 
the initial vector curve) and compute the energy value E makei corresponding 
to Point i as shown in Formula (5) (after omitting E out (i) ). 

® Count 7 =1 to m (where m represents the neighborhood pixel quantity 
of a single point on the curve. Here the 5x5 range is selected, i.e. m = 25). 
As for Point / on the curve, .compute the respective values of 
( rf -h"Vi|) h + i- 2v ; +v ;-i| E img U) and E sm (j) of its 
neighborhood point, implement normalization in the neighborhood, 
compute the respective values of m energies, and sum up their standard 

deviations. The reciprocals of these standard deviations are the values of 
a(i) , /?0')> HO and ?7(z) Thjs may be computed as follows: 

E j = 5>(0W -h -v y -.||) 2 +^(0||v, +1 -2v . +v._ 1 | 2 - rmmg (j) + Tj(i)E sim (j) 
lf Ej<E snake . ^ E makeJ =Ej and jmin = j Meanwhile, the next 
j'circulation maybe performed unti | aN j circulations are completed. 
© Move the No. i point v,. t ^ v jmin location, i.e. v,. =v jmin Process the 
next point v M untj | g || i circulations have been completed. 

The above circulation is the energy minimization process. The energy 
minimal points are searched for within the neighborhood range of each 
point on the curve during every circulation so as to substitute the original 
points. 

Step 6: Sum up the moved point quantity in the second-stage optimization 
process, i.e. the quantity of the points which have locations with minimum 
energies differing from the original ones. 

Step 7: When the quantity of the moved points remains the same or the 
total energy value fluctuates periodically, the computing shall be ceased 
immediately, otherwise, continue to Step 8. An iteration optimization in the 
second stage is completed duri ng Steps 4- 7. 

Step 8: Repeat Steps 4-7. 



4. Precision evaluating model based on similarity 
measurement 

I n order to evaluate the performance and extracting precision of the model, 
the evaluating model is built based on the similarity measurements, and its 
basic goal is to measure the similarity between the extracted and true 
contour curves in the image. The true contour curve may be extracted 
manually. This paper makes measurements based on the shape similarity, 
neighboring degree of spatial location and area similarity. The contour side 



lines extracted using the active contour model are expressed as the set of 
ordered points A = {P i = (x i ,y j )\i = l,2,...,N} , and the true contour curve 
extracted manual I y is expressed as B = {V i = (x t , y t ) I i = 1, 2, M } . 

(1) Shape similarity: first, describe the contour side lines with center 
distance function. In regard to shape A (B similar), S{P t ) signifies the 
distance between P i and the center of contour shape A, thus allowing N 
values of S(P t ) onthecontoursidelinetobeobtained. Normalize the mean 
value of SiPj) , then implement the Fourier transform to the N S(P t ) , as 
shown below: 

s( m ) = — Y S(P) exp C j27rmi \m = 0, 1, .. ., N - 1 (6) 

Nto ' N 

Obtain the coefficient vector S A =[s(l),s(2),...,s(L)] via the former L 
coefficients after the Fourier transform. In general, the value of L is 16. If 
L is too large, the high frequency coefficients of the Fourier transform may 
contain noise. In the same manner, it is possible to obtain the S B 
correspondi ng to the contour shape B. The similarity of shapesAand B may 
be expressed as shown below: 

s 1 =l-IIS A -S fl ll(j 1 e[0,l]) (7) 

(2) Neighboring degree of spatial location: The Hausdorff distance is 
appl i ed to measure the nei ghbori ng degree of the I ocati ons (for the detai I ed 
computing process, refer to Sebastien (2008)). Supposing after 
normalization the Hausdorff distance between A and B is H AB , the 
nei ghbori ng degree of the I ocati on may be expressed as fol I ows: 

s 2 =1-H ab (s 2 g[0,1]) (8) 

(3) Area similarity: Supposing A x is the area of A and B l is the area of B, the 
area si milarity may be expressed as fol I ows: 

s 3 =l-\A l -A 2 \ /maK(A 1 ,A 2 )(s 3 e [0,1]) (9) 

The computing formula of the total similarity sim(A,B) may be obtained 
after integrating Formulas (7), (8) and (9), as shown below: 



sim(A, B) = co x s l + co 2 s 2 + o> 3 s 3 (10) 



Where a\,co 2 ,co^ are corresponding weight coefficients, and in this paper 
their values are all 1/3. Thus, the result precision may be measured 

via^z'm(A,5) . 



5. Experiment and analysis 

5.1. Experimental area and data 

The experimental area is located in an area of Suzhou region which contains 
dense waters. Three perennial lakes, namely^, H Z ,H 3 , were selected for 
use in the experiment. The vector data is the 11000000 data produced in 
the 1990s; the coordi nate system is the 54 coordi nate system; the proj ecti on 
is the conic orthermorphic projection; the currency of the image data is 
December 2010; and the image data is corrected orthophotos. The 
coordinate system is the WGS84 coordinate system and the projection is 
Mercator projection with a resolution of 10 m in both the horizontal and 
longitudinal directions, with a longitude range of 120°~121°3o' and latitude 
range of 30°~31°. The total i mage size i s 14485x11391 pi xels. 

5.2. Experimental results and analysis 

5.2.1. Experimental results 

First the gray values of the image data were processed, and then converted 
from the WGS84 coordinate system to the 54 coordinate system, and from 
Mercator to conic orthermorphic projection. The vectors were overlaid with 
images, and the contour side lines of the lakes were extracted and updated 
as described in the realization steps shown in Section 3.3. Figures4, 5 and 6 
show the respective results of H l ,H 2 ,H 3 . N represents the iterations in the 
second stage: 




^,=213, Srf B =19J.s,=0_86, s, =0.90. s.=0.SS. ^=0.94, iV=16 . j, =0.91 . S; =0.%, 

j,=0.S6. 3^=0.72. sim(A,B) =0.S1 sim(A 5} =0.90 Sj =0_95 sim(A, S) =056*-' 



(a) I nitial data (b) Results in first stage (c) Results in second stage 
Figure 4. Experimental results of//, 



A vg,=30.2 , Std, =20 j j 3 =0.71 , j a =0.79 , s. =0.89 , j, =0.83 , A r =3S , j, =0.92 , £, =0.93 , < 
Sj=0.B4j £,=0.77, shs(A5)=0.77 sim{A B) =0.S4 4=#JSf, fl) =0.95 +J 

(a) I nitial data (b) Results in first stage (c) Results in second stage 
Figure 5 Experimental results of ff 2 




A vg, =32 , Sfcf ( =12 j jj =0.90 , i, =0.9 , s ; =0. 39 , =0.93 , N=72, s, =0.96 , s, =053 , «J 
=0.70, £,=0.33, im(|,5)=0J3 saa'E4JB)=tlf9i s,=0.%j ¥m(4, 5) =0.95+J 

(a) I nitial data (b) Results in first stage (c) Results in second stage 

Figure 6 .Experimental results of H 

The graph of precisions corresponding to //, changing with iterations was 
then drawn, and isshown in Figure7(in a manner similar to H 2 and H 3 ): 
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Figure 7. Relationships between precision and iterations Figure 8- 
Relationships between weight and iterations 



In order to analyze the weight changes occurring with the iterations, this 
paper analyzes the weight changes of Points E a(i) , /?(/) , y(i) and t](i) 
in Figure 6 as they occur with the iterations, and the results are shown in 
Figure8. 



5.2.2. Analysis of boundary conditions of the results and model 
After anal yzi ng the results the fol I owi ng concl usi ons were drawn: 

(1) After first-stage processing, most points on the vector contour curve 
approached the edges, and the extraction precision enhanced greatly 
compared with the initial vector data. However, due to the influence of 
noise points or surrounding rivers, the external background points may be 
mistakenly judged as the areas or edges, such as Points A, B, E, F and G 
shown in Figures 4, 5 and 6. These points cause the contour curve to differ 
greatlyfromtheinitial one. 

(2) 1 n the second stage, Points A, B, E, F and G, which previously reached 
local extreme points in the first stage, are forced to A', B', E', F' and G', due 
to the effect of similarity restriction potential energy, and thus the final 
shape of the contour curve remains the same as the initial curve. In 
addition, i n the second stage, the curve approaches the edges slowly, owi ng 
to the effect of short-range force and internal energy, and the precision 
enhances slowly; the model still possesses certain disadvantages over the 
extraction of the weak edges, such as Areas C and D, as shown in Figure 5 
(c). 

(3) 1 1 is clearly known from Figure 7 that the total extraction precision, 
neighboring degree of distance and shape similarity all increase slowly with 
the increase of iterations, while the area precision fluctuates. When the 
number of iterations are more than 10 times, the four precisions tend to 
gradually become consistent. This phenomenon conforms to the evolution 
law of the curve; it is known from Figure 8 that in the initial iterative stage 
the weight coefficients differ greatly. However, with the increase of 
iterations, the weights decrease gradually and tend to become consistent, 
which adequately indicates that the change rates of energy items are 
inconsistent at the beginning of iteration when the points move in the 
neighborhood, whereas when the iterations increase, the change rates tend 
to become consistent. It is therefore shown that when the weights are 
determined based on personal experiences, incorrect results may be 
produced. 



Due to the fact that the active contour model is sensitive to initial location, 
further discussion regarding the initial boundary conditions of the model is 
made below: 

( 1) The area rati o between the area water body i n the i mages end osed by the 
vector data and vector data must be larger than 50%, otherwise the model 
may not be able identify the background and object points automatically, as 
described in the method detailed in Section 3.1 

(2) The vector contour must be very similar to the image contour in shape 
and the two contours must not differ greatly, otherwise the similarity 
restriction potential energy may exhibit a reaction. 

The above two boundary conditions are absolutely necessary. However, the 
present model may not be able to detect the two boundary conditions 
automatically. H uman-computer interaction with the area water body must 
be performed, resulting in poor extraction effect, due to the inconformity 
with the aforementioned two boundary conditions involved in the specific 
application process. 

5.3. Algorithm comparison and analysis 

The H x described in Section 5.1 is selected as the experimental datum to 
compare the differences in results and efficiency between the method 
discussed in this paper and the initial active contour model (the grouping 
snake model presented in Kass(1987) and HAN(2009)). Figures 9 (a) and (b) 
show the results obtained by means of the initial active contour model (the 
greedy algorithm is also adopted as a solving method) and the grouping 
snake model. Table 1 shows the comparison between extraction precision 
and consumed time: 




Figure 9 Extraction results of traditional model and Reference [7] 
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Kass(l987) 


0.83 


0.88 


0.76 


0.82 


2.8s 


HAN(2009) 


0.90 


0.88 


0.92 


0.90 


4.9s 


This paper 


0.97 


0.96 


0.95 


0.96 


3.3s 



Table L Comparison of extraction algorithm 

After compari ng the three methods the f ol I owi ng concl usi ons were drawn : 

(1) 1 1 is difficult for the traditional model to converge to the contour edges 
due to the I i mi tati ons of noi se i nf I uence and sphere of acti on of the gradi ent 
force, and thus the extraction precision is low. 

(2) The grouping snake model shows a great improvement over the 
traditional methods, as it may converge to the contour edges in the area 
with no noise influence. If it is affected by noise, the points on the curve 
may converge to local extreme poi nts easi ly, causi ng the process to requi re 
a I arge amount of ti me. 

In summary, the method raised in this paper is a great improvement in 
noise immunity and extraction precision over the active contour model 
previously used. 



6. Conclusion 

Not only may the existing vector data be used as the initial location of the 
active contour model, but they may also provide a large amount of 
previously existing knowledge during the process of extracting and 
updating the area water body with the active contour model from the 
remote sensing image data. This paper adequately utilizes previously 
existing knowledge, adds similarity restriction potential energy (the 
purpose of which is to avoid the attraction and disturbance of noise points 
to the points on the contour curve and avoid large amounts of deformation 
from occurring between the final curve and actual results) and image 
gravitation potential energy (the purpose of which is to enhance the 
convergence speed of the model) based on the traditional active contour 
model; then the paper adopts a two-stage search tactic during model 
solving, solves the problem of self-adaptive determination of weight 
coefficients, and builds the precision evaluating model based on similarity 
measurement. Finally, the results of experiments show the feasibility and 



superiority of the proposed method. The disadvantages and the aspects 
whi ch requi re further study i ncl ude the f ol I owi ng: 

(1) Further improve the model in order to make it available for the 
extraction of weak edges (i.e. Areas C and D in Figure 5 (c)). 

(2) Settle the matching problem of the image and vector objects not being 
II The paper extracts and updates the area water body using the vector 
data and active contour model, with the one precondition that the vector 
and i mage objects must be 1 1 Only in this way may the vector data be used 
as the initial location of the active contour model. If there are vector data 
but no corresponding objects in the images (10), or if there are objects in 
the images but no vector data (0:1), then one vector datum corresponds to 
multiple image objects or multiple vector objects correspond to one image 
object (IN or N:]), or multiple vector objects correspond to multiple image 
objects (M:N), and the issue which requires further study is how to 
determine and solve the aforementioned situations automatically. 
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